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GENERAL DESCRIPTION 

The SC-01 Speech Synthesizer is a completely self-contained 
solid state device. This single chip phonetically synthesizes 
continuous speech, of unlimited vocabulary, from low data 
rate inputs. Figure 1. 

Speech is synthesized by combining phonemes (the building 
blocks of speech) in the appropriate sequence. The SC-01 
Speech Synthesizer contains 64 different phonemes which are 
accessed by a 6-bit code. It is the proper sequential 
combination of these phoneme codes that creates continuous 
speech. 

The SC-01 Speech Synthesizer is cost-effective, consumes 
minimal power and enables in-house product development 
without vendor dependency. Signals from the SC-01 are 
applied to an audio output device to amplify and distribute 
the synthesized speech. See Figure 2. 




Figure 1. Votrax® SC-01 Speech Synthesizer 



FEATURES 
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• Single CMOS chip 

• 70 bits per second 

• 22 pin package 

® 9 ma. current drain 

® Wide voltage supply range 

• Latched 5V. compatible inputs 

• Digital pitch level inputs 

• Automatic inflection 

^ On-chip master clock circuit 

• Optional external master clock 

• Variety of voice effects 

• Sound effects 

• Customer product security 
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The design of the equipment specified herein is proprietary. 
Rights for the reproduction and distribution of the data contained 
herein are granted except for the manufacture and reproduction 
of the subject equipment. 



Figure 2. SC-01 Flow Diagram 



Votrax® reserves the right to alter its product line at any time, or change specifications or design without notice and without 
obligation. 
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PHYSICAL DESCRIPTION 



PHONEME DESCRIPTION 



The SC-01 Speech Synthesizer is a 22 pin Large Scale ! ntegrated 
Circuit which contains all the circuitry necessary to generate 
phonetically synthesized speech. The SC-01 is fabricated using 
CMOS technology, which offers high input innpedance and low 
power drain. 

ELECTRICAL DESCRIPTION 

The SC-01 Speech Synthesizer is a program-compatible with 
existing Votrax® phoneme synthesizers. It requires 70 bits of 
data per second for continuous speech production. The 6-bit 
phoneme codes are 5 volt logic compatible and are latched for 
data bus applications. A phoneme-construction algorithm and 
filters, within the chip, create the synthesized audio output. 



Table 1 lists the 64 phonemes produced by the SC-01. Each 

phoneme code is accompanied by its symbol, average duration 
time, and an example. The underlined segments of the 

example word demonstrate the phoneme use, i.e., sound to be 
pronounced. 

Table 2 subdivides the 64 phoneme symbols into seven 
categories. Each category represents a different production 
feature. The first six categories are characterized by voiced, 
fricative (expired voice), and nasal sounds. The seventh 
category is characterized by phonemes with no sound output. 

PHONEME PROGRAMMING 

Manual Operations: Votrax® maintains a library of 
phonetically programmed words. Reference to this library and 
programming manuals will aid in word synthesis. 



Automatic Operations: Votrax® can- supply a micro-computer 
system for automatic conversion of English text into phoneme 
sequences. This system is particularly useful for in-house 
vocabulary development and product security. Contact 
Votrax® for further information. 
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PHYSICAL DIMENSIONS 
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Figure 3. SC-01 Footprint and Outline Dimensions 
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Table 1. Phoneme Chart 



Phoneme 


Phoneme 


Duration 


Example 


Code 


Symbol 


(ms) 


Word 


00 


EH3 


59 


jacket 
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EH2 


71 
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EH1 
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/T/ must precede /CH/ to produce CH sound. 
/D/ must precede /J/ to produce J sound. 
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Tabfe 2. Phoneme Categories According to Production Features 



Voiced 








'Voiced' 
Fricat. 


'Voiced' 
Stop 


Fricative 
Stop 


Fricative 


Nasal 


No 
Sound 


E EH 


AE 


UH 


001 


Z 


B 


T 


S 


M 


PA0 


El EH1 


AE1 


UH1 


R 


ZH 


D 


DT 


SH 


N 


PA1 


Y EH2 


AH 


UH2 


ER 


J 


G 


K 


CH 


NG 


STOP 


Yl EH3 


AH1 


UH3 


L 


V 




P 


TH 






1 A 


AH2 





lU 


THV 






F 






11 A1 


AW 


01 


U 








H 






12 A2 


AW1 


02 


U1 














13 AY 


AW2 


00 


w 















Votrax® reserves the right to alter its product line at any time, or change specifications or design without notice and without 
obligation. 



Copyright Votrax® 1980 



-3- 



SIGNAL DESCRIPTION (See Figures 4 and 5) 



NOTE 



Phoneme 6-Bit Selection Code (P0-P5): Data input is to six 
pins. Latching is controlled by the strobe (STB) signal. 

Strobe (STB): Latching occurs on rising edge of strobe signal. 

Inflection Level Setting (11, 12): Instantaneously sets pitch level 
of voiced phonemes. 

Acknowledge/Request (A/R): Acknowledges receipt of 
phoneme data (signal goes from high to low one master clock 
cycle following active edge of STB signal). Also indicates 
timing out of old phoneme concurrent with request for new 
phoneme data (signal goes from low to high). 

NOTE 

If external phoneme timing is desired, phoneme 
requests can be ignored. However, best speech is 
realized with internal timing. 

Master Clock Resistor-Capacitor (MCRC): This input 
determines the internal master clock frequency. Select R-C 
values for 720 kHz to achieve standard phoneme timing. 
Connect this input to MCX when using internal clock; ground 
when using external clock. 



Varying clock frequency varies voice and sound 
effects. As clock frequency decreases, audio 
frequency decreases and phoneme timing 
lengthens. Figures 6 and 7 illustrate manual and 
DAC (Digital to Analog Converter) voice variation 
schematics, respectively. 

Master Clock External (MCX): Allows control by an external 
clock signal. 

NOTE 

Ground MCRC during MCX operation. 

Audio Output (AO): Supplies analog signal to audio output 
device. 

Audio Feedback (AF): Used with Class A or Class B transistor 
audio amplifiers for added stability. 

Class B (CB): Current source for Class B transistor audio 
amplifier. 



Table 3. Timing Specifications 



CHARACTERISTIC 


SYMBOL 


WIN 


TYP 


MAX 


UNIT 


Input Setup Time (P| to STB) 


Ts 


450 






m 


Input Hold Time (P, to STB) 


Th 









NS 


Rise Time of STB Edge (-8V to 4V) 


Trs 






100 


NS 


A/R Width (A/R Connected to STB) "- 


"^ARW 


1 


1.3 


2 


^s 


STB Width 


^SW 


200 






NS 


STB Low* 


TSL 


* 






NS 


Propagation Delay (STB toA/R after T^RW) 


^DAR 






500 


NS 


A/R Rise Time (Capacitive load = 30pf) 


^RAR 






1M 


NS 


A/R Fall Time (Capacitive load = 30pf) 


^FAR 






100 


NS 


Time from A/R Request to STB Service) 


^ARS 







500 


jLiS 


Time of Phoneme Duration "'" 


^PH 


47 


107 


250 


MS 



-h Dependent on Master Clock frequency: 720kHz 

* Strobe must remain low (72x Master Clock Period) before rising edge 

Votrax® reserves the right to alter its product line at any time, or change specifications or design without notice and without 
obligation. 
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Figure 4. Timing Diagram 
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Figure 5. SC-01 Block Diagram 
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Figure 6. Variable Voice by Potentiometer Control 
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Data at address must be a pause phoneme code. 

Multiple Message, Fixed Block Size: See Figure 9. Message 
address block is loaded into the counter. The message is then 
clocked out of the ROM by the A/R signal. 

NOTE 

Message Block = 2" maximum. 

Multiple Mess^e, Variable Block Size: See Figure 10. The 
microprocessor loads phonemes into a data bus. The A/R 
signal generates an interrupt request for each new phoneme. 



CONNECTING THE AUDIO OUTPUT DEVICE 

Audio Output: The AG signal has a maximum peak to peak volt- 
age swing of .26 times Vp, depending upon the phoneme se- 
lected, and the AG signal is D.C. biased. 



Figure 7. Variable Voice by DAC Current Injection 



TYPICAL APPLICATIONS 

General: The SC-01 Speech Synthesizer is easily designed into 
systems ranging in complexity from ROM/counters to 
microprocessor controllers. 

Single Message System: See Figure 8. When the counter is re- 
leased (START is TRUE), the message is clocked out of the ROM 
by the A/R signal. The system must be stopped when DONE is 
TRUE. Note: When using A/R tied to STB, connect a .01 uf 
capacitor to TP3 to insure power up reset of SC-01 . 



Class A Amplifier: See Figure 11. For a single transistor 
amplifier, the selection of R, C, or Rg values depends upon the 
value of Vp and the desired audio level. 



START > 




DONE <^ 



Figure 8. Single Message System 

Votrax® reserves the right to alter its product line at any time, or change specifications or design without notice and without 
obligation. 
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Figure 9. Multiple Message, Fixed Block Size 
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Class B Amplifier: See Figure 12. A current source (CB) is 
required for this push-pull amplifier. 

NOTE 



Minimum power is consumed when speech is 
inactive. When Vp = +12.0 volts and Rg = 40 
ohms, the bias current drain is approximately 3.5 
milliamps. 

Controlling Audio Output Power: See Figure 13. A resistor or 
potentiometer from the speaker to ground can be used to 
control the audio output power. 
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Figure 10. Multiple Message, Variable Block Size 
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Figure 11. Class A Amplifier 



Figure 12. Class B Amplifier 



Figure 13. Controlling Audio Output Power 



^For Class B Amplifier: (p) x (Rg min.) = 81.6 x (Vp) where p is beta or current gain of transistor. The AO line is protected by an 
internal series current limiting resistor of 90 ohms maximum. If more current is required of the SC-01, then the above formula 
indicates distortion will occur. 
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Table 4. Analog Output Specifications 



CHARACTERISTIC 



MIN 



MAX 



UNIT 



Output Voltage (AH Phoneme) 

Output Bias Current ** (.6V< CB< Vp) 



.18 X Vp 
3.5 



.26 X Vp 
7.3 



Vp-p 
mA 



ELECTRICAL CHARACTERISTICS: T^ = to 70°C, Vp = 10 V^^^ to 14 Vj 



DC 



CHARACTERISTIC 


MIN 


TYP 


MAX 


UNIT 


Digital Input Impedance 


1 meg. 






Ohm 


Input Capacitance (P,, STB) 






3 


Pf 


Input Capacitance (11, 12, MCX) 






8 


Pf 


Digital Input Logic "0" (except 11, 12, MCX) 


Vg - (3.5 




Vg + (3.8 


Vdc 


Digital Input Logic "$" (MCX) 






Vg +1.(2) 


Vdc 


Digital Input Logic "0" (11, 12) 






.2 X Vp 


Vdc 


Digital Input Logic "^" (except 11, 12, MCX) 


Vg + 4.0 




Vp + a.b 


Vdc 


Digital Input Logic ''1" (11, 12) 


.8 X Vp 






Vdc 


Digital Input Logic "^" (MCX) 


4.6 






Vdc 


Digital Output Logic "0" (1 sink = (b.SmA) 






Vg+0.5 


Vdc 


Digital Output Logic "^" (1 source = 0.5mA) 


Vp-0.5 






Vdc 


Power Supply Current Vp = 9V 




9.1 




mA 


Vp= 9V** 




11 


18 


mA 


Vp= 14V** 




18 


27 


mA 


* Master Clock Frequency 




720K 




Hz 


MCX Input Duty Cycle 


60:40 




40:60 


% 


Master Clock Resistor Value (MCRC)*** 


6.5k 






Ohm 


Master Clock Capacitor Value (MCRC)*** 






sm 


pf 



*Varfab/e 
**With CB, AF,AO connected for Class B audio amplifier (see APPLICATION NOTES) 
* * * Frequency of Master Clock en, 1 .25 / R C 
Note: TP1, TP2 must be left open for normal operation. 
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Table 5. Absolute Maximum Ratings 



ABSOLUTE MAXIMUM RATINGS * 



RATING 



Power Supply Voltage 

Power Dissipation at 25 C 

Derating Above 25 C 

Operating Ambient Temperature 

Storage Temperature 

Input Voltage 

DC Current Max. Above Vp+l^.5V 

Lead Temperature (soldering 10 sec! 



SYMBOL 


VALUE 


UNIT 


Vp 


20 


Vdc 


^DM 


650 


mW 




5 


mW/°^ 


To 


to 70 


°C 


"^STG 


-55 to 125 


°c 


^INM 


-0.5 to Vp+0.5 


Vdc 


'iNM 


1.0 


ma 


Tl 


300 


°C 



* Operation above these limits could damage the device. 

NORMAL OPERATING CONDITIONS: 10v$ Vp -< 14v, 0° C $ Tq $ 70°C 
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FIGURE 8. Typical Application 
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See Note 2, Table 6 



Figure 9. Low power, 
multiple message, 8 Byte 
block size. 



NOTE*: Circuit uses A/R line as strobe. R-C provides an initial 
strobe to insure power up reset of SC-01 A/R output. 
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CIRCUIT DESCRIPTION 

The following is a brief functional description of the pin-outs on the 
SPEECH-PAC™ (see Schematic Figure 2). For a more detailed explanation of 
their operation and interaction, refer to SYSTEM OPERATION. 

EDGE CARD PARALLEL INTERFACE (Jl): 

PINS* FUNCTION* 



1,2,3,4 Power and Ground connections. Note that power 

may be supplied through either the Edge 
Connector (Jl) or the Post Connector (J2). 

12 SEL: controls the mode of operation (1 or 2) 

of the SPEECH-PAC 



15 SEL: inverted state of SEL at any time. 

6 LOAD: pulse input used during Mode 1 operation 

to load the desired ROM Address. 



7 EOM: used during Mode 1 operation to indicate 

when the last phoneme of the requested word/phrase 
has been loaded into the SC-01 Speech Chip« 

A,B,C,D,E,F P0-P5: during Mode 2 operation, these lines are 

loaded with the desired phoneme code. 

A,B,C,D,E,F,H,J L0-L7: during Mode 1 operation, these lines are 

loaded with the ROM Address of the desired 
word/phrase. 

14 L8: this pin is provided to facilitate expansion 

of the system from a 16K ROM (2716) to a 32K ROM 
(2532). This expansion is easily accomplished 
by: 

a) Breaking the connection between Jumpers 
2 and 3. 

b) Making a connection between Jumpers 1 and 2« 

c) Replacing the current 2716 ROM with a 
2532 ROM. 
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PINS^ FUNCTION^ 

14 From this point on^ Pin 14 (L8) would be 

utilized as the HSB of the ROM Address. 

KgL II, 12: used during either Mode 1 or Mode 2 

to set the inflection level of the SC-01 Speech 
Chip. 

9 A/R: used during Mode 2 operation to determine 

when the SC-01 Speech Chip is ready for another 
phoneme. 

10 STB: used during Mode 2 operation to load the 

phoneme present on P^»P5 into the SC-01 Speech 
Chip* 

Note: all inputs are one TTL load« 

POST CONNECTOR (J2): 

PINS FUNCTION 

4,55657 Power and Ground connections. Note that 

power may be supplied through either the 
Edge Connector (Jl) or the Post Connector (J2). 

1 AUDIO OUT: output signal from the Audio 

Amplifier. 

2 AUDIO GND: Ground level for the Audio 

Amplifier. 

8 MCKIN: provided to facilitate the utilization 

of an external clock by the user. This may be 
accomplished by: 

a) Breaking the connection between 
Jumpers 4 and 5. (Disables the 
internal clock.) 

b) Making a connection between Jumpers 

5 and 6, and a second connection between 
Jumpers 4 and 7. 

From this point on, the clock signal present 

on Pin 8 (MCKIN) will be used to drive the system. 
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SYSTEM OPERATION 

The following is a description of the two modes of operation available on 
the SPEECH-PAC: 

MODE 1 (WORD NODE) 

This mode is utilized to access words or phrases pre-stored in the ROM. 
The sequence of events involved is as follows: 

a) SEL is driven low to select Mode 1. This de-selects the phoneme 
lines (P0-P5) by disabling U6 (see Schematic Diagram #1), and 
enabling EPROM U7. 

b) The address of the desired word or phrase is placed on ROM 
Address load line L^-L7. Refer to Table 4 for a complete 
list of the available words/phrases and their corresponding 
addresses. Note that when loading the address, 10 is the least 
significant bit. 

c) The LOAD line is driven high. This loads the address into the 
ROM JAM COUNTERS (U2, U3, U4) and thereby to the ROM itself. 
Note: for proper operation, the data on lines 10^-17 must be 
valid for a period of 30 nsecs. prior to the load line going 
high and while load line is high. Following this, the LOAD 
line must be driven low -- otherwise the ROM JAM COUNTERS will 
not be updated, and the SPEECH-PAC" will remain "stuck" on one 
phoneme. 

d) The phoneme stored in ROM at the presented address is output 
and loaded into the SC»01 Speech Chip. Datalines D6 and D7 on 
the ROM (U7) are high, thereby allowing incrementation of 

the ROM JAM COUNTERS to the next address and setting-up the next 
phoneme. When the SC-01 (Ull) is ready for the next phoneme, it 
raises X/R (Pin 8, Ull) high. Since, in this mode, "A/R is 
connected to STB (Pin 7, Ull), STB is also drjven high 
thereby loading the phoneme into the SC-01. A/R then returns 
low, driving STB back low. It is this pulse which increments 
the ROM JAM COUNTERS (U2, US, and U4). 
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MODE 1 (WORD MODE) 

e) This sequence is repeated until the last phoneme of the word/phrase 
is outputted from the ROM. In this phoneme. Bit #6 (D6, U7) is low. 
This signal clears the ROM JAM COUNTERS, thereby resetting the 
address presented to the ROM* This reset address accesses a phoneme 
in the ROM in which Bit #7 (D7, U7) is low. This signal in effect 
disables the ROM JAM COUNTERS and drives line EOM" low, thereby in- 
dicating the end of the sequence to the user. The last phoneme is 
still being vocalized via the SC-01 at this time. Therefore, if the 
user wishes to output another word/phrase immediately after 
completion of the current word/phrase, simply return to Step 'b' 
immediately and start again. If a pause is desired between the 
current word/phrase and the next, simply wait the desired length of 
time and then return to step 'b'. 

MODE 2 (PHONEME MODE) 

In this mode, the user may construct his own words, phrases, or sound effects 
by loading phonemes directly to the SC-01 Speech Chip. The sequence of 
events involved are as follows: 



a) SEL line is driven high to select Mode 2 operation. This selects the 
phoneme lines P0-P5 by enabling U6. This also disables U7 and the 
hardware by which the STB signal (Pin 7, Ull) is controlled by the 
A/R signal (Pin 8 Ull). This permits complete control of the SC-01 
Speech Chip by the user. 

b) The desired phoneme code is placed on phoneme lines P^-P5. Refer to 
Tables 1 & 2 for a complete list of valid phonemes and their cor- 
responding codes. Note that P0 is the least significant Bit when 
strobing in phoneme codes. 

c) A test is then made to determine the state of the A/R line. When 
this line is high, it indicates that the SC-01 Speech Chip is ready 
to receive a phoneme. Once this condition has been detected, the 
STB line should be driven high to strobe in the phoneme code present 
on lines P^-PB into the SC-01. The STB line should then be driven 
low again, after which the A/R line will go low until the SC-01 is 
ready for the next phoneme. This process is repeated until the 
entire phoneme sequence desired has been output. 
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NOTE: If desired, the..SC-01 can be configured to control itself in 
terms of loading_ phonemes. This is easily accomplished by 
connecting the A/R line to the STB line. In this method, the 
A/R line is monitored and, each time this Tine goes low, the 
next phoneme is placed on lines P^-P5. Wh_en the SG-01 is 
ready for the next phoneme, it raises the A/R li ne,_thereby 
raising the STB line and loading the phoneme. The A/R line 
is then driven low by the SC-01, which also lowers the STB 
1 ine. 

NOTE: For Complete timing details for either method of control, 

refer to Table 3 (Timing Specifications) and Figure 3 (Timing 
Diagram) . 

INFLECTION: 

It is probably desirable during normal operation, that the two inflection 
inputs to the SC-01 Speech Chip both be held low. However, these inputs may 
be altered at any time by the user. This is accomplished by placing the 
desired inflection code on lines II and 12, and can be done in either Mode 1 
or Mode 2. When a phoneme code is strobed into the SC-01, the new inflection 
code will also be latched. This inflection will remain in effect until the 
states of the II, 12 lines are altered. 

CONTROLS: 



There are two external controls available on the SPEECH-PAC". These are: 

a) Volume Control: This is the knurled wheel (PI) located next to 
the Post Connector (see Figure 1). It is used to control the 
volume level of the output speech^ 

b) Master Clock Frequency: This is the adjustable pot (P2) located 
near the center of the SPEECH-PAC" near the EPROM (see Figure 1). 
Altering the setting of this control alters the frequency of the 
Master Clocks This, in turn, alters the rate and pitch of the 
output speech. 



NOTE: On initial power up only it might be necessary to start in 

Mode 2 and Strobe *STB' line once before proceeding to Mode 1« 



15 




6 6 6 () 6 (] Q (] C] 6 6 f) 6 () 6 C] (h 6 6 



16 



